Using just the credit and loyalty card data, identify the most popular locations, and when they are popular. What anomalies do you see? What corrections would you recommend to correct these anomalies? Please limit your answer to 8 images and 300 words.
By observing loyalty and credit card data, we can observe that the top three most popular location during the 2 weeks data are Katerina’s Cafe followed by Hippokampos, and Guy’s Gyros.
As showed in the data table below, both loyalty and credit card data shows a consistent result for the popular location.
Stacked bar below shows which date are the most popular for the these popular place based on loyalty and credit card data, which apparently shows a different results.Based on loyalty card transaction data Katerina’s Cafe was popular on 11 January 2014, with 19 transactions from GASTech employees. Hippokampos has the most transaction on 8 January 2014 and lastly Guy’s Gyros has the most transaction on 15 January 2014. However based on credit card data Katerina’s Cafe is most popular on 6 January 2014, Hippokampos on 16 January 2014 and Guy’s Gyros on 13 January 2014
Interactive bar graph was created below by using credit card dara to observe the patterns of visiting the top three locations in different hours of the day. We can see that most of the GASTech employee visited Katerina’s Cafe and Guy’s Gyros during dinner time, on the other hand Hippokampos is more popular during lunch time, except for weekends.
As we can see above, the number of daily frequency of transactions are different between credit card data and loyalty card data. Further observations was performed and it shows that there are total of 409 un-matched records. These un-matched records might lead to some new clues. Unmatched records can be seen in the table below
To correct this anomaly, we can add vehicle data to our analysis, we might be able to determine some of the transactions based on the employee location.
Add the vehicle data to your analysis of the credit and loyalty card data. How does your assessment of the anomalies in question 1 change based on this new data? What discrepancies between vehicle, credit, and loyalty card data do you find? Please limit your answer to 8 images and 500 words.
Based on employee location, we can figure out the timing range for cc transaction in Bean There Done That, Brewed Awakenings, Jack’s Magical Beans. Even though there are some transaction in Jack’s Magical Beans that is not found in location data.
There are few credit card and loyalty transactions that cannot be traced from the vehicle data. There is a possibility that employee are not using company vehicle to go to a shop or restaurant.
There are some shop that is not shown in the maps below for example Abila Zacharo, Hippokampos, Kalami Kafenion. Therefore, it was challenging to figure out the position of some location. Matching the vehicle data with credit card data will help us to figure out some unknown locations, however there are still some shops that are not detected, one of the example is Daily Dealz.
Reading layer `Abila' from data source
`C:\jovinkahartanto\assignment_distill\MC2\Geospatial'
using driver `ESRI Shapefile'
Simple feature collection with 3290 features and 9 fields
Geometry type: LINESTRING
Dimension: XY
Bounding box: xmin: 24.82401 ymin: 36.04502 xmax: 24.90997 ymax: 36.09492
Geodetic CRS: WGS 84
Some location that can be traced by using credit card data are Abila Zacharo, Hippokampos, Kalami Kafenion and few other shops.
Reading layer `Abila' from data source
`C:\jovinkahartanto\assignment_distill\MC2\Geospatial'
using driver `ESRI Shapefile'
Simple feature collection with 3290 features and 9 fields
Geometry type: LINESTRING
Dimension: XY
Bounding box: xmin: 24.82401 ymin: 36.04502 xmax: 24.90997 ymax: 36.09492
Geodetic CRS: WGS 84